Picture for Xiaochi Wei

Xiaochi Wei

UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems

Add code
May 26, 2026
Viaarxiv icon

Tournament-GRPO: Group-Wise Tournament Rewards for Reinforcement Learning in Open-Ended Long-Form Generation

Add code
May 26, 2026
Viaarxiv icon

Knowledge-Graph Paths as Intermediate Supervision for Self-Evolving Search Agents

Add code
May 07, 2026
Viaarxiv icon

PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training

Add code
Apr 04, 2026
Viaarxiv icon

JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG

Add code
Jan 29, 2026
Viaarxiv icon

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning

Add code
Jan 29, 2026
Viaarxiv icon

Efficient Thought Space Exploration through Strategic Intervention

Add code
Nov 13, 2025
Viaarxiv icon

AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning

Add code
Oct 17, 2024
Figure 1 for AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
Figure 2 for AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
Figure 3 for AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
Figure 4 for AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning
Viaarxiv icon

From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions

Add code
Oct 10, 2024
Figure 1 for From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Figure 2 for From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Figure 3 for From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Figure 4 for From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions
Viaarxiv icon

LLMs + Persona-Plug = Personalized LLMs

Add code
Sep 18, 2024
Figure 1 for LLMs + Persona-Plug = Personalized LLMs
Figure 2 for LLMs + Persona-Plug = Personalized LLMs
Figure 3 for LLMs + Persona-Plug = Personalized LLMs
Figure 4 for LLMs + Persona-Plug = Personalized LLMs
Viaarxiv icon